EIGENVALUES OF SUMS OF PSEUDO-HERMITIAN MATRICES 



PHILIP FOTH 

Abstract. We study analogues of classical inequalities for the eigenvalues of sums of 
pscudo-Hermitian matrices. 



1. Introduction 

The classical triangle inequality says that for a triangle with side lengths a, b and c, 
one has \a — b\ < c < \a + b\. If one considers the space M. 3 with the Minkowski norm 
\(x, y, z)\ 2 = z 2 — x 2 — y 2 , then in the future timelike cone, defined by z 2 — x 2 — y 2 > 0, 
z > 0, the triangle inequality gets reversed, and the sides of a triangle a + b = c satisfy 
|c|>|a| + |6|. This can be interpreted in terms of 2 x 2 traceless pseudo-Hermitian 
matrices, if one puts into correspondence to a vector with coordinates (x,y, z) the matrix 

( z x + T ■ y 

\ —x + >/— T • y —z 

The eigenvalues of this matrix are ±a/z 2 — x 2 — y 2 and therefore the Minkowski triangle 
inequality answers the following question: given two traceless pseudo-Hermitian matrices 
with real spectra (a, —a) and (b, —b) and non-negative upper-left entries, what are the 
possible eigenvalues of their sum? Explorations of this and related questions for Hermit- 
ian symmetric matrices (and more generally for triangles in dual vector spaces of compact 
Lie algebras) led to many exciting developments bridging across algebra, Lie theory, rep- 
resentation theory, symplectic geometry, geometric invariant theory, vector bundles, and 
combinatorics, see for example, [3], [6] and references therein. A brief answer to this 
question can be formulated as follows: given two Hermitian symmetric matrices A and 
B, the set of eigenvalues for their sum A + B necessarily belongs to a convex polytope 
defined by certain linear inequalities on the sets of eigenvalues of A and B. 

In the present paper, we begin answering a similar question in the non-compact set- 
ting. Let G = U(p, q) be the pseudounitary Lie group, g its Lie algebra, and g* the 
dual vector space identified with the space of pseudo-Hermitian matrices A, defined by 
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the condition A = J pq A*J pq , where J pq = diag(l, 1, — 1, — 1) and A* is the conju- 

p g 

gate transpose. In general, eigenvalues of pseudo-Hermitian matrices are not necessarily 
real, unless A is elliptic. And moreover, the eigenvalues of the sum of even two elliptic 
elements can be pretty much arbitrary complex numbers. However, if one restricts to 
the convex cone of admissible elements [7j , then the question about possible eigenvalues 
of the sum becomes more meaningful. In our situation, the convex cone of admissible 
elements g* dm will consist of matrices, which are G-conjugate to diagonal (and thus real) 
matrices diag(Ai, X p , fj,i, fi q ) such that Aj > fij for all pairs We can certainly 
assume that A's are arranged in the non-increasing order X\ < X 2 < ■ ■ ■ < X p and /u's are 
in the non- decreasing order [i\ > ^ > ■ ■ • > [i q (this is done for convenience), and thus 
the condition of admissibility becomes rather simple: \\> n\. 

For two admissible matrices A, B G fl^dm w hh given spectra, the question of finding 
possible eigenvalues of their sum can be formulated in terms of the non-abelian convexity 
theorem in symplectic geometry. The coadjoint orbits Oa and Ob of A and B carry 
natural invariant symplectic structures and so does their product Oa x Ob- A general- 
ization due to Weinstein [9] of the original Kirwan's theorem to the case of non-compact 
semisimple groups implies that the possible spectrum of A + B forms a convex polyhedral 
set in the positive Weyl chamber i* + of the dual space to the diagonal torus. 

The primary purpose of this note is to reveal some of the defining conditions on this set, 
in particulr obtaining an analogue of classical Lidskii-Wielandt inequalities [10J. Let us 
formulate our result and explain its geometric meaning. For A,B e g* dm and C = A + B, 
let Xi(A), fij(A), Xi(B), fJ.j(B), Aj(C), (J.j(C) be their eigenvalues in the order as above. 
Then for each m integers 1 < i\ < i% < ■ ■ ■ < i m < p and £ integers 1 < ji < j% < ■ ■ ■ < 
je < <? we have 

m m m 

k=l k=l k=l 

and 

t i z 

k=l k=l k=l 

Of course, in addition, we have the trace condition: 

E ^ + E = E X ^ A ) + E ^M) + E A *( fl ) + E ■ 

1=1 j=l i=l j=l i=l j=l 

We also state a more general analogue of Thompson- Freede inequalities [8] . Recall from 
Theorem VIII. 1.19] that the set of possible diagonal entries of an admissible matrix 
A with eigenvalues ( A , ~jt) as above, form a convex polyhedral set Sa, which can be 
described as the sum II + C of a polytope II and a cone C. The polytope IT is the convex 
hull of S p X S q .( X , fi) - so its vertices are obtained by the action of the Weyl group for 
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the maximal compact subgroup (the product of two symmetric groups in our case). The 
cone C is given by the non-compact roots, which in our case means that it is the R + -span 
of the diagonal differences au — ajj for 1 < i < p and p+ 1 < j < n. The above inequalities 
have then the following geometric interpretation: possible eigenvalues of A + B belong to 
the convex polyhedral region ( A (A), ~~jl(A)) + Sb (of course, due to symmetry, we can 
interchange A and B and get another set of conditions). 

In this note we only deal with analogues of classical eigenvalue inequalities, leaving 
out natural questions of relationship with tensor products of representations of G and 
combinatorics. 



2. COURANT-FlSCHER THEOREM FOR PSEUDO-HERMITIAN MATRICES 

Let p and q be non-negative integers, n = p+q, and let G = U(p, q) be the pseudounitary 
group of n x n matrices M, satisfying MJ pq M* = J pq , where J pq is the diagonal matrix 

l p A 

o -i J ' 

Let g = u(p, q) be its Lie algebra of matrices B, satisfying BJ pq + J pq B* = and let g* 
be its dual vector space, which is identified with the space T ■ of pseudo-Hermitian 
matrices A, satisfying AJ pq = J pq A* . In the block form, 

B ) , 

where H p and H q are p x p and q x q Hermitian symmetric matrices respectively and 
B is a complex p x q matrix. Let 0* dm denote a convex component of the open cone of 
admissible elements, in the terminology of [7J. In general, an element A G Q* is said to 
be admissible if the co-adjoint orbit Oa is closed and its convex hull contains no lines. 
In the pseudounitary case, this translates to the requirement that the coadjoint orbit of 
A contains a diagonal matrix A = diag(A p , \\,H\, ••,/ i g) ) where A p > A p _i > • • • > Ai, 
> )U 2 > ■ - - > and either Ai > fii, or fi q > X p . There are two open cone components, 
and without loss of generality we choose g* dm to be the component in which Ai > fix. 

Let us consider the complex vector space C n with the pseudo-Hermitian pairing of 
signature (p, q) : 

p n 

(Z,W) = ^ZiWi - ^ Z 3™3 ■ 
i=l j=p+l 

If we introduce the notation 

then we can rewrite the above pairing in terms of the usual product: 

(z, w) = • z . 
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Let us also denote by Of the open cone of positive vectors, satisfying (z, z) > 0, and 
similarly by C™ the cone of negative vectors. Our condition that A is admissible is 
equivalent to saying that it has real eigenvalues, and the p eigenvalues corresponding to 
the eigenvectors in are larger than the q eigenvalues corresponding to the eigenvectors 
in C\ 

Now we shall examine an appropriate analogue of the Rayleigh-Ritz ratio, defined as 



TZa fx 



x"l"x 

Lemma 2.1. Let A G 0adm have the eigenvalues 

(2.1) X p > A p _i > • • • > Ai > /Ui > \i-2 > ■ ■ ■ > \i q . 

Then one has 

Ai = min TZaM and ui = maxT^fx) . 

xec" xec™ 



Proof. Let U G G be such a matrix that A = UAU 1 , where A, as before, is the diagonal 
matrix A = diag(A p , Ai, fi\, .., /i g ). Note that U' 1 = W, where W = J pq U*J pq . Since 
U G G, the group of linear transformations of C n , preserving the pairing (z, w), its action 
on C n preserves and C™. For x G C™, denote y = C/'x, y G C™. Since x^x = yty > 
and (Wx.y = x^U, we have 

x f ylx y f Ay 



Then we need to show that 



xtx yty 



y f Ay > Aiy f y, 

which trivially follows from (12.11) . 

The second statement for /ii follows from the statement for Ai, by changing A to —A. 
Q.E.D. 

Next, let vi, .., v p , wi, w g be a basis of eigenvectors of A in C n , corresponidng to 
the eigenvalues Ai, A p , fix, \i q respectively and orthonormal with respect to (•,•). 
In particular, we have that ||vj|| 2 = 1, ||wj|| 2 = —1 and the pairing of any two different 
vectors from this basis equals zero. Let also, for convenience, denote V = Span{vi, v p } 
and W = Spanjwi, ...,w g }. Note that for 

x = ttiVx H h a p v p + ftwi H (3 q w q 

the quotient 1Za{^) can be written as 



K A ht) 



cUx ELil«ii 2 Ai-E'=il^l>,- 



ELiN 2 -El=ilA-l 
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From the previous Lemma and the fact that (-, ■) restricts to a positive definite Her- 
mitian pairing on the subspace V, which is orthogonal to W with respect to (•,•), we 
deduce: 

Lemma 2.2. 

(2.2) Afc = min TZa{x) and Afc = max TZa(x) ■ 

xeC^, x±vi,...,v fc _i x6V\{0}, xlv H1 ,...,v p 

A similar statement is, of course, valid for //fc's: 

Hk = max TZa{x) and = min 1Za(x) ■ 

xGC™, x±wi,...,Wfc_i ' xeVK\{0}, x±w fe+1 ,...,w 9 

Now we are ready to state and prove a result, similar to the classical Courant-Fischer 
theorem. 

Theorem 2.3. Let A G 0^ m be an admissible pseudo-Hermitian matrix with eigenvalues 
as in ( QOp . Let k be an integer, 1 < k < p. Then 

(2.3) Afc = min max TZa(x) 

ui,...,u n _ fc ec n xec™, xiui,...,u„_j. 

(2.4) Afc = max min 1Za{x) 

ui,...,u fc _ 1 GC n xeCV, x-Lui,...,Ufc_i 



Proof. Our line of proof follows the standard argument for the classical Courant-Fischer 
theorem [5]. We will only consider f)2.3p . as the second equality is similar. As in Lemma 
12.11 let y = Wx., where A = UAU -1 , and A = diag(A p , Aj, Hi, .., fi g ). Then 

sup Ha(x) = sup ^-A(y) 

X6C™, x±ui,...,u n _ fe y eC™, yJJ7tui,... ) trtu n _ fc 

> sup ft A (y) ^ x k ■ 

yeC^, y±C/t Ul ,...,C/tu„_ fe , 2/p _ fc+1 =...= 2/p =0 

But (I2.2p shows that the equality holds if we take = Wj for 1 < z < q and Uj = Vfc_ g+i 
for g + l<z<n — fc. Thus 

Afc = min sup TZa(x), 

ui,...,u n _ fc eC" xeC ^ x ±ui,..,u„_ fc 

and (i2Ti|) is similar. Q.E.D. 

Note that, in general, the ratio TZa{x) is not bounded from above on C™ . Therefore in 
the right hand side of the formula (12.31) . the maximum should be taken over the (n — k)- 
tuples of vectors for which it is actually achieved, and otherwise one might want to use 
sup instead of max. 

The above theorem obviously has a natural counterpart, consisting of two series of 
minimax and maximin identities, for /ifc's. We omit stating and proving those, since it 
can easily be done if one replaces A by its negative. 
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It is also worth noticing that one can rewrite the equality (12.31) in the following form: 

(2.5) Afc = min max "^(x) , 

w k xec^, xew fc 

where W k is a subspace of dimension k, which in fact can be taken entirely lying in C™ 
(with the exception of the origin, of course). 

Next, we state a result similar to one found in pQ. We will omit the proof since it is a 
repetition of a standard argument: 

Proposition 2.4. For an admissible pseudo-Hermitian matrix A as above, and a positive 
integer k < p, one has 

k 

Ai + A 2 H hA fc = min y^^(xj) . 

(x ? ;,Xj) — Sij 

1=1 



Note that the condition (xj,Xj) = 5ij automatically implies that all of the x^'s belong 
to CI. 



As another easy corollary to Theorem 12. 3[ we have the following analogue of classical 
Weyl inequalities: 

Proposition 2.5. Let A, B 6 g* dm and let \{A), ^j(A), Xi{B), fij(B), Xi(A + B), 
/j,j(A + B) be the eigenvalues of A, B, and A + B arranged in the order as in Ii2.1\) . Then 
for each 1 < k < p and 1 < I < q we have: 

X k {A + B)>X k (A)+X 1 (B) and H {A + B) < fj, t {A) + m{B) . 

Proof. We will only prove the first inequality, as the second is similar. We know that for 
each x e C", one has 1Zb{*) > X\(B). Hence, using the linearity property of the ratio 
7^a+b( x ) = 7^4( x ) + 7^b( x ), for 1 < A; < p we have 

Xk(A + B)= min max 1Za+b{x) 

ui,...,u n _ fe eo xecj, xiui,...,u„_ fc 



^ min r „ r „ ™ ax (Tl A (x) + X^B)) = X k (A) + X 1 (B) . 

ui,...,u n _ fc eC n XGC2., x±ui,...,u n _ fc 



Q.E.D. 



3. LlDSKII-WlELAND AND THOMPSON-FREEDE TYPE INEQUALITIES 

In this section we will establish stronger inequalities for the eigenvalues of the sum of 
two admissible pseudo-Hermitian matrices. The first goal of this section is to prove the 
following 
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Theorem 3.1. Let A, B e fl* dm and let Xi{A), Hj{A), \{B), (J.j(B), A;(C% fij{C) be the 
eigenvalues of A, B, and C = A + B arranged in the order as in ( fj8.jp . Then for each m 
integers 1 < i\ < i<i < ■ ■ ■ < i rn < p and £ integers 1 < ji < j2 < • • • < ji < q we have 

m m m 

(3-1) XX^E^ 4 ) + 

k=l k=l k=l 

and 

iti 
(3-2) X>j*(^<X>i*(4) + £/**(B) • 

k=l k=l k=l 



In what follows, we will only work on proving (13.11) . as (13.11) is similar. 

For m < p, us have a fixed m-tuple of integers 1 < i% < %i < ■ ■ ■ < i m < p. Consider a 
flag of subspaces C Vi 2 C ■ • • C Vi m , where VL \ {0} C C" and the subscript indicates 
the dimension of the corresponding subspace. We say that an orthogonal set of vectors 
{x ii; Xj 2 , x im } is subordinate to this flag, if x,. £ V*. and (x^,x ife ) = 8 jk . 

Denote by P m the projection operator onto the Y = Spanjx^, Xj 2 , x im }. Here the 
projection is taken with respect to (•,•), and is therefore given by the matrix XX^, where 
the j-the column of X is Xj.. For any A £ g*, the operator P m AP m is also pseudo- 
Hermitian, but its restriction to Y is actually Hermitian, and we let rji < rj2 < ■ • • < i] m 
denote the set of its eigenvalues. We have the following analogue of a classical result of 
Wielandt fTU]: 



Lemma 3.2. For A £ g* dm with eigenvalues as in (OOP; and rji 's as above, we have 

m m 

> Aj = min max > t)a . 

3 v H cv t2 c-cv lm x &/ ^ 
3=1 j=i 



We postpone proving this rather technical lemma till the next secion, and now state an 
easy corollary: 

Proposition 3.3. For A £ g^dm eigenvalues as in \2. jp and an m-tuple of integers 
1 < ii < i 2 < • ■ • < *m < P, one has 



(3.3) ^TA 4j .= min max^T^x,.) 

3=1 3=1 



Proof. One can easily see that the right-hand side of (13.31) is exactly the trace of the 
Hermitian operator P m AP m acting on the space Y, because 
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and as such, equals Y^jLiVj- Q-E.D. 

Now we can establish an analogue of Lidskii-Wieland inequalities. 

Proof of Theorem \3.1[ For a given m-tuple of integers 1 < i\ < i 2 < ■ ■ ■ < i m < P, let us 
choose a flag of subspaces ^ C C ■■• C Vi m in C™ so that for any orthogonal set of 
vectors {x^, Xj 2 , x im } subordinate to this flag, one has 

m m 

i=i i=i 

As Proposition 13.31 shows, this is always possible. Now note that 

m m m 

E^K) = E^K) + E^K) ' 

j=i j=i j=i 

and use Proposition 13.31 once again to choose an orthogonal set of vectors {x^, x i2 , 
x im } subordinate to the flag C Vi 2 C ■ ■ • C Vi m such that 

m m 

J>^.)>5> 3 .(A). 
3=1 i=i 
Next, note that Proposition 12.41 implies that 

m m 

3=1 3=1 

and the result follows. Q.E.D. 

We now state an analogue of Thompson- Freede inequalities [8] (without proof). Let us 
have two m-tuples of integers 1 < i\ < i<i < ■ ■ ■ < i m < p and 1 < ji < j 2 < ■ ■ ■ < j m < V 
such that i m + j m < m + p. Then 

m mm 

E K +3h - h (c) > £ X ih (A) + E A ^( s ) • 

/i=l h=l h=l 

A similar inequality can be stated for //'s as well. 

4. Proof of Lemma 13721 

Following the standard path of proving such results as outlined, for example, in the 
Appendix by B.V. Lidskii to [4], the lemma will follow if we prove the following two 

statements: 

I. For any flag of subspaces C Vi 2 C - • • C Vi m in C™ , there exist a subordinate set of 
vectors {x ii; x ia , x im }, such that 

m m 

E^ -^E A ^ • 

i=i 3=1 



EIGENVALUES OF SUMS OF PSEUDO-HERMITIAN MATRICES 



9 



II. There exists a flag V n Cl^ 2 C ■•■ C V{ m such that for any subordinate set of vectors 
{x ii; Xj 2 , x im }, one has 

m m 
3=1 3=1 

We will first prove II. Set 

Vy = Span{vi, ViJ , 

where v 1; .., v p G C™ are eigenvectors of A, corresponidng to the eigenvalues Ai, 
A p respectively. Note that Vi, \ {0} C C™. Let {x ix , Xj 2 , x im } be a set of vectors 
subordinate to the chosen flag, and let We be an I- dimensional subspace in their span. 
We know from the classical minimax identities that 

rit < max K PmA p m (x) • 

Note that forx e We, we have 7£p m AP m ( x ) = ^-a( x )- Thus if we let Wi = Spanjx^ , x i2 , ...,x i( ,}, 
then the fact that We C Vi e will imply 

max Ha(x) < max 7^.4 (x) . 

But the maximum in the right-hand side is achieved on the eigenvector Vj f and equals Aj r 
(We recall that the operator yl is trivially Hermitian on the span of its eigenvectors from 
C™.) Thus 

■qe < max 7£p m AP m (x) = max Tl A {x-) < max ^(x) = X ie , 

proving II. 

Now we turn to proving I, by induction on p. Note that for p = 1, the statement 
amounts to showing that 

Ai = min m , 

Vi 

where V\ is a one-dimensional subspace in C™ . This is not hard to establish directly, and 
in any case, is an easy consequence of [21 Proposition 4.1]. 

Now we can take m < p, since in the case when m = p, the statement is again a 
consequence of loc.cit. We consider two subcases: 

1). When % m < p, there exists a (p — l)-dimensional subspace R p -i of C" , containing 
the whole flag C Vi 2 C • ■ ■ C V{ m . Let P v -\ be the operator of projection onto R p -i. 
Consider the pseudo-Hermit ian operator A p _i = P p _iAP p _i, which is actually Hermitian, 
being restricted to R p -\. Clearly for all x e R p -i, one has 7^Ap_i(x) = 7Za (x). If we 
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denote by £1, ..,£ p _i the eigenvalues of v4 p _ 1; in the non- decreasing order, then according 
to loc.cit., one has 

(4.1) &>\i for 1 < i <p- 1 . 

By the inductive hypothesis, for any flag Vi x C Vi 2 C • • • C Vi m in i? n _i, there exists a 
subordinate system of vectors {xj i; X; 2 , x im } such that 

m m 

E^E&; ' 

3=1 3=1 

and we are done in this case. 

2). Now consider the case i m = p. Assume i m = p, i m -\ = p — 1, i m -s = p — s and 
that the number (p — s — 1) is not a part of the m-tuple 1 < z x < i 2 < ■ • ■ < i m < p. Let 
it be the largest remaining element of this m-tuple (the case when there is no such left 
requires only a minor and trivial modification of our discussion). The corresponding flag 
of subspaces now takes the form 

V h C V l2 C • • • C V it C V it+ i c • • • c V p . 

Let Vp- S , Vp_ s +i, v p be the eigenvectors of A corresponding to the s + 1 largest 
eigenvalues. Let R n -i be the subspace of C n spanned by these vectors and containing V it 
and all the Wj-'s. Such a subspace exists since i t < p — s — 2 and thus s+l + it<p — 1. 

Consider yet another flag of subspaces: 

(4.2) ^ t C #p_ s _i c C • • • C i2p_i , 

where Rj = Vj+\ H i? n _i- (In the degenerate case when the dimension of the intersection 
does not drop by 1, we can artificially remove one extra dimension.) 

Again, let us introduce the operator A p _i = P v _xAP p _\ on the space R p -i as before. 
Using our inductive assumption, we can find a subordinate system of vectors 

j x i2 > •••) x it ) x p— s— 1 ) ■••) x p— l} 

such that 

m t p— 1 

+ E - • 

3=1 3=1 j=p-a-l 

where £'s are the eigenvalues of A p _i arranged in the non- decreasing order. According to 
(14. ip . we have 

The vectors v p _ s , v p _ s+1 , v p belong to the subspace R p -i and are eigenvectors for 
A p _i. Thus the corresponding eigenvalues A P _ S ,...,A P are dominated by ^ p _ s _ 1; £ p _i, 
which are the largest (s + 1) eigenvalues of A p _i. Thus we conclude that 

Sii + 6 2 + ' ' ' + 6 t + £p-s + ' ' ' + £p — ^il + + ' ' ' + ^it + ^p-s + ' ' ' + \ 
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Since the system {x^, x^, x^, x p _ s _x, x p _ 1 } is subordinate not only to the orginal 
flag, but also to (14. 2p . we have completed the proof. 
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